A Maximum-Entropy Partial Parser for Unrestricted Text
نویسندگان
چکیده
This paper describes a partial parser that assigns syntactic structures to sequences of partof-speech tags. The program uses the maximum entropy parameter estimation method, which allows a flexible combination of different knowledge sources: the hierarchical structure, parts of speech and phrasal categories. In effect, the parser goes beyond simple bracketing and recognises even fairly complex structures. We give accuracy figures for different applications of the parser.
منابع مشابه
APOLN: A Partial Parser Of Unrestricted Text
In this paper, we present APOLN (Analizador Parcial de Oraciones en Lenguaje Natural): a partial parser of unrestricted natural language sentences based on finite-state techniques. Partial parsing has been used in several applications: syntactic parsing of unrestricted texts, data extraction systems, machine translation, solving the attachment ambiguity, speech recognition systems, text summari...
متن کاملA Block-Based Robust Dependency Parser For Unrestricted Chinese Text
Although substantial efforts have been made to parse Chinese, very few have been practically used due to incapability of handling unrestricted texts. This paper realizes a practical system for Chinese parsing by using a hybrid model of phrase structure partial parsing and dependency parsing. This system showed good performance and high robustness in parsing unrestricted texts and has been appli...
متن کاملA Maximum Entropy Chinese Character-Based Parser
The paper presents a maximum entropy Chinese character-based parser trained on the Chinese Treebank (“CTB” henceforth). Word-based parse trees in CTB are first converted into characterbased trees, where word-level part-ofspeech (POS) tags become constituent labels and character-level tags are derived from word-level POS tags. A maximum entropy parser is then trained on the character-based corpu...
متن کاملUsing a maximum entropy-based tagger to improve a very fast vine parser
In this short paper, an off-the-shelf maximum entropy-based POS-tagger is used as a partial parser to improve the accuracy of an extremely fast linear time dependency parser that provides state-of-the-art results in multilingual unlabeled POS sequence parsing.
متن کاملA maximum entropy shallow functional parser for spoken language understanding
In this paper we investigate a maximum entropy approach to spoken language understanding. We compare this approach with a parser based on finite-state transducers. The parsers are evaluated on a corpus of utterances modelling human-computer interactions within a single domain. The corpus was annotated with task-oriented semantic categories to obtain a set of shallow functional parse trees. We f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره cmp-lg/9807006 شماره
صفحات -
تاریخ انتشار 1998